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ABSTRACT 

We investigate the effects on cosmological clustering statistics of empirical biasing, 
where the galaxy distribution is a local transformation of the present-day Eulerian 
density field. The effects of the suppression of galaxy numbers in voids, and their 
enhancement in regions of high density, are considered, independently and in combi- 
nation. We compare results from numerical simulations with the predictions of simple 
analytic models. We find that the bias is generally scale-dependent, so that the shape 
of the galaxy power spectrum differs from that of the underlying mass distribution. 
The degree of bias is always a monotonic function of scale, tending to an asymptotic 
value on scales where the density fluctuations are linear. The scale dependence is of- 
ten rather weak, with many reasonable prescriptions giving a bias which is nearly 
independent of scale. 

We have investigated whether such an Eulerian bias can reconcile a range of theo- 
retical power spectra with the twin requirements of fitting the galaxy power spectrum 
and reproducing the observed mass-to-light ratios in clusters. It is not possible to 
satisfy these constraints for any member of the family of CDM-like power spectra in 
an Einstein - de Sitter universe when normalised to match COBE on large scales and 
galaxy cluster abundances on intermediate scales. We discuss what modifications of 
the mass power spectrum might produce agreement with the observational data. 

Key words: Galaxies: clustering; cosmology: large-scale structure of Universe. 
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1 INTRODUCTION 

The great advances in observational cosmology during the 
past decade have provided a wealth of new data on galaxy 
clustering. These results facilitate a phenomenological ap- 
proach to the study of large-scale structure, in which the 
mass power spectrum is reconstructed directly from the 
observed clustering statistics (e.g. Peacock & Dodds 1994, 
PD94). This method is complementary to the hitherto con- 
ventional technique of judging a pet cosmogony on the basis 
of its predictions for a series of observationally-determined 
quantities, and has some advantages so long as no specific 
model appears capable of accounting for the full set of ob- 
servational data. 

There is, however, a fundamental problem with both the 
empirical and the a priori approaches to large-scale struc- 
ture. The linear mass power spectrum that one deduces from 
the observed clustering depends on, and may be very sen- 
sitive to, an assumed relationship between the distributions 
of mass and galaxies in the Universe, and ignorance of the 
detailed processes through which galaxies form leaves this 
relationship poorly constrained. Different classes of galaxy 



are observed to cluster differently (see, e.g., the compila- 
tion by PD94), implying that at least some galaxies are bi- 
ased and do not directly trace the mass distribution. Bias 
is also required by advocates of an Einstein - de Sitter uni- 
verse, to reconcile observed cluster M/ L values that imply 
f2 ~ 0.2 with the assumed critical density. In the absence 
of a complete understanding of galaxy formation and evolu- 
tion, large-scale structure theorists are left to model bias in 
as plausible a manner as they can. 

Many such attempts have used the high-peak bias 
method (Davis et al. 1985; Bardeen et al. 1986, BBKS). We 
will term this a Lagrangian bias scheme, since it identifies 
the sites of nascent objects (galaxies or clusters) with peaks 
in the initial density field, smoothed on a scale appropri- 
ate to the mass of the objects under consideration. If the 
initial density field is Gaussian, then these peaks are more 
strongly clustered that the mass distribution as a whole, 
producing the desired bias. iV-body simulations (e.g. Katz, 
Quinn & Gelb 1993) have revealed a poor correspondence 
between the particles found in galaxy halos selected at late 
times and those located at appropriate peaks in the initial 
density field, thus casting doubt on Lagrangian galaxy bias 



© 0000 RAS 



2 R.G. Mann, J. A. Peacock and A.F. Heavens 



prescriptions (although the work of Mo & White 1996 and 
Mo, Jing & White 1996 suggests Lagrangian bias may work 
statistically): this is not surprising, since the density field 
smoothed on the scale of galaxies is highly non-linear. For 
cluster-sized halos, however, Lagrangian bias may still be 
appropriate (Kaiser 1984; Cole & Kaiser 1989; Mann, Heav- 
ens & Peacock 1993), since their spatial distribution has 
undergone far less dynamical evolution. 

In reality, galaxy formation must be more complex than 
this, involving a range of feedback mechanisms (Dekel & 
Rees 1987; Babul & White 1991). One might hope that all 
these issues would eventually be clarified by large numeri- 
cal simulations which include a hydrodynamical treatment 
of baryons, as well as dissipationlcss dark matter, but such 
codes are in their infancy. An intriguing early result from one 
such code was the observation by Cen & Ostriker (1992; CO) 
of a tight correlation between the present-day (Eulerian) 
density field, pm, and the local number density, pg, of galax- 
ies, formed using what those authors termed a 'heuristic but 
plausible' prescription, which creates a dissipationlcss proto- 
galactic particle wherever the local baryonic component is 
sufficiently dense, rapidly cooling, contracting and Jeans un- 
stable. Together with the observed uniformity of A'l / L val- 
ues as a function of scale, these results motivate the general 
idea of Eulerian bias models, in which the galaxy number 
density at redshift zero is a function of the local value of 
the evolved mass density field. Some general aspects of such 
prescriptions were considered by Coles (1993), who deduced 
various important inequalities relating the mass and galaxy 
correlation functions in these models. 

Our approach here is as follows. We make no assump- 
tions concerning the physical processes that lead to bias, 
simply taking their overall effect to be a local transforma- 
tion of the Eulerian density field, the principal features of 
which are the suppression of galaxy numbers in low-density 
regions and their enhancement in regions of high density. 
In Section 2 we outline the set of bias prescriptions com- 
bining these components which are to be studied here, us- 
ing the combination of numerical and analytical methods 
described in Section 3, while Section 4 shows the effect of 
biasing CDM-like mass models by these methods. We focus 
on the question of the scalc-dcpcndcncc of the Fourier space 
bias parameter, h{k), defined as the square root of the ratio 
of the galaxy and mass power spectra: 



h{k) ^ \Al{k)/Al{k)]"\ 



(1) 



where Ag(A;) and A^(fc) are, respectively, the galaxy and 
mass power spectra. We employ the dimensionless form of 
the power spectrum, A^(fc), which is defined to be the vari- 
ance per In k [i.e. A^(fc) — du^/d In k oc k^P{k)]. Our results 
indicate that b(k) tends to a constant value on scales where 
the density field is linear, with only a weak tendency to 
change with scale for most reasonable bias prescriptions: we 
shall denote this asymptotic large-scale bias value by 6oo. 

Finally, we attempt to find a combination of a CDM-like 
mass model with a reasonable normalisation and a biasing 
prescription that can reproduce the observed power spec- 
trum of galaxy clustering, while reconciling observed cluster 
M/L ratios with the critical density assumed in Einstein - 
de Sitter models. Details of this exercise and its results are 
given in Section 5, and are discussed in Section 6, where we 
also present the conclusions we draw from them. 



2 EULERIAN BIAS 

Our method is to choose some function to relate the galaxy 

and mass densities 



Pg(x) = /[Pm(x)], 



(2) 



and investigate its biasing properties: we assume that these 
density fields are defined on scales larger than that of in- 
dividual galaxies. This is straightforward where the matter 
density is a continuous function, as is the case if there is 
an analytical model for the density field. The default model 
for the field at early times is a random Gaussian process, 
but this must be modified at late times when the field be- 
comes non-linear. The simplest model which introduces the 
skewness necessary to avoid negative density is a lognor- 
mal density field (Coles & .lones 1991), for which the biased 
correlation function can be calculated analytically in many 
cases. These results are derived in the Appendix, and yield 
simple expressions for the linear bias in the limit of weak 
correlations: we shall quote these below as 

1/2 

(3) 



; lim 

r— i'oo 



4m(r) 



More realistic non-linear distributions are provided by 
A''-body simulations, but here the density field is discrete, 
and must be smoothed in order to yield a continuous field. 

In practice, our approach is to weight the particles in the A'^- 
body simulation, depending on the local density of particles: 



w = /[pm]/Pn 



(4) 



and to identify the resultant weighted particle density with 
the local galaxy density. Galaxies, of course, also form a dis- 
crete set of points, but we assume galaxies to be a Poisson 
sample of some continuous field, whose properties are to be 
estimated: we refer the reader to Coles (1993) for a discus- 
sion of the Poisson clustering model and local bias. 

From the infinity of possible relationships between mass 
and galaxy density, we restrict ourselves to three paramet- 
ric forms which are nevertheless sufficiently flexible to allow 
general conclusions to be drawn. The first two model the 
effects of increasing galaxy numbers in high density regions 
and suppressing them in voids, while the third is a func- 
tional form motivated by the numerical work of CO. The 
functional forms we consider are detailed in the next three 
sections. 

It is important to note that these models quantify the 
relationship between the mass and galaxy distributions at 
redshift zero, rather than solely the process of galaxy forma- 
tion itself: as emphasised by CO and others, the present-day 
distribution of galaxies reflects their movement since birth 
as nmch as the locations of those births. In particular, it 
may be difficult to predict the evolution of bias (Matarrese 
et al. 1997). 

2.1 Power law bias 

The first model we consider is power law bias: 

Pg « Pm, (5) 

i.e. we apply a weight to each mass particle oc p^~^. With 
B > 1 equation (5) represents the enhancement of galaxy 
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numbers in high density regions. In the Appendix we show 
that the large-scale bias paxameter for power law biasing of 
lognormal mass models is simply 6ln = B. 

A reasonable choice for B would be the value required 
to decrease the mass-to-light ratio in clusters by a factor 
of about 5, sufficient to reconcile AI/L measurements with 
an Einstein - de Sitter universe. If we take the centres of 
clusters to correspond to a mass overdensity of ~ 1000, this 
suggests B ~ ln(5000)/ln(1000) = 1.23, which is perhaps 
a surprisingly small degree of non-linearity. This is only a 
rough calculation, since it ignores the fact that the trans- 
formation also scales the mean luminosity density. For the 
lognormal model in the Appendix, this correction can be 
calculated, leading to the slightly higher value of B ~ 1.4. 
This argument is encouraging, since a small degree of bias 
is exactly what is needed on large scales. The linear frac- 
tional rms density contrast in 8 Mpc spheres is about 
0.9, whereas the abundance of rich clusters suggests a figure 
of 0.5 0.6 if n = 1 (White, Efstatliiou & Prenk 1993); a 
Unear bias of about 1.5 is thus required. 



2.2 Censoring bias 

The next prescription we study is censoring, in which we 
assume that galaxies arc not found in regions where the local 
mass density falls below a certain value, pm'- this represents 
both the migration of gala:xies from underdense regions and 
the possibility that the galaxy formation process features 
some density threshold, below which galaxies cannot have 
formed by the present. In the absence of other bias, this is 
the 'weighted bias' model of Catelan et al. (1994), but it may 
readily be combined with power law bias, to yield a model 
where galaxy numbers are enhanced in high density regions 
and simultaneously suppressed in regions of low density: 



Pm if Pm > Pn 



otherwise. 



(6) 



In the lognormal approximation the large-scale bias for this 
model takes the form 



6ln = S -I- ■ 



1 - erf 



\ ^ J 



(7) 



where f is the threshold for censoring in the Gaussian pro- 
cess from which the Eulerian density field is generated by a 
lognormal transformation: 1 + 5ln ~ exp{SG — o /2), where 
5ln and 5g are the fractional overdensities in the lognormal 
and Gaussian field, respectively; cr^ is the variance of the 
latter; and erf denotes the error function. A more meaning- 
ful way to express the threshold is in terms of the fraction, 
/m, of mass below the threshold, related to the Gaussian 
field by 



/m = 0.5 



1-l-erf 



^/2 



(8) 



We shall generally use /m to define density thresholds in this 
paper, although, equivalently, one could use /v, the fraction 
of the volume which is censored (i.e. given zero galaxy den- 
sity), given by 



/v = 0.5 



1-herf 



^/2 



(9) 



In the case of B = 1 equation (7) reduces to the result de- 
rived by Catelan et al. (1994) for their weighted bias model, 
once account has been taken of the presence of an unwanted 
factor of l/\/2 in the defirution of their quantity i/. 

In order to compare the large-scale bias level, &ln, pre- 
dicted for a particular bias scheme by the lognormal model 
with that derived from a numerical simulation, we need to 
estimate . Given that the density field is not exactly log- 
normal, the determination of cP' will not be unique. We es- 
timate it by least-squares fitting to the differential form of 
(8), using (9) to relate the parameter v to the mass density: 
in what follows we show that the efficacy of the lognormal 
model may be judged independent of any problems in esti- 
mating cr^. 



2.3 Cen-Ostriker bias 

The final model we consider is inspired by the work of CO, 
who found a tight pg — pm relation well fitted by the form 



In 



Co Ci In 



-C2 



In 



(10) 



This is a generalisation of power law bias, with the intro- 
duction of the second parameter allowing greater freedom 
for the variation of the pm — Pg relationship: for example, 
with C2 < one may mitigate the effects that a large power 
law bias has in the densest environments. Although Cen & 
Ostrikcr found specific values for Co, Ci and C2, we take 
them to be free parameters in this analysis. The value of 
6ln for the CO models is given by 



Ci — C20 

1 - 2C20-2 



(11) 



as shown in the Appendix. This model has also been stud- 
ied by Little & Weinberg (1994), who investigated the void 
probability function under CO bias. 



3 MODEL GALAXY POWER SPECTRA 

For each of the models considered, we compute a galaxy 
power spectrum from the weighted particle distribution in a 
numerical simulation. The power spectrum is generally still 
biased on the scale of the fundamental mode, so the galaxy 
power spectrum is extended to larger scales by assuming a 
constant bias on larger scales (see Section 3.2). 



3.1 Small scales: numerical simulations 

Our numerical simulations wore produced using the AP3M 
code of Couchman (1991). All simulations were of an Ein- 
stein - dc Sitter universe, and used the CDM transfer func- 
tion of BBKS: 



T(fc): 



ln(l 2.34g) 



2.34g 



[1-1-3.1 



' + (16.1g)' + (5.46g)^ + (6.71q)^] (12) 



where 
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q = k/r*, (13) 

wavenumbers, k, are in units of /iMpc~^, the Hubble con- 
stant is Ho = lOO/i kms~^ Mpc~^, and V* is the shape pa- 
rameter, which for a pure CDM universe with total and 
baryonic density parameters f2o and fie respectively, is well 

fit (Sugiyama 1994) by the form 

r* = no/icxp(-nB - f^B/^o). (14) 

(N.B. our shape parameter, F*, is defined slightly differently 
from the F parameter introduced by Efstathiou, Bond & 
White 1992.) 

We consider F* to be a free parameter, so that equations 
(12) and (13) describe a family of CDM-likc mass power 
spectra, l\^{k) oc k'^T'^{k): wc consider only models with the 
Harrison- Zcldovich (n — 1) primordial power spectrum. 

The form of these equations means that a particular 
output from a CDM-like simulation with a given value of F* 
may bo identified with a different epoch by reinterpreting 
the box side: 

ibox/ /i"^ Mpc oc 1/r*. (15) 

Changing the assumed F* in this way also alters the implied 
value of 178 , so a single simulation output gives results along 
a locus in (F*,cr8) space. Furthermore, different time-step 
outputs may be used to study different loci, so spanning the 
whole family of CDM-Iiko models at the present epoch. One 
cannot use the earliest time-steps, which bear the traces of 
the initial conditions, but this powerful technique can cover 
the interesting part of the (F*,0-8) space of mass models. 
The results in this paper come principally from a simulation 
of a 25/F* Mpc box with 80^ particles. 

Biasing is implemented by weighting particles according 
to their local density. The local density must be calculated 
quite carefully to avoid discreteness problems in low-density 
regions. We divide the simulation box into A^"^ cubic cells of 
side £, setting the density of the particles in a cell according 
to the number of particles in it. If, however, there are fewer 
than five particles in the cell, the cell side is repeatedly dou- 
bled until at least five particles arc found. Essentially the 
assumption in this process is that there is no structure on 
scales smaller than the mean local inter-particle separation. 
Inevitably there are some particles in very large void regions, 
and it is computationally inefficient to extend this procedure 
of doubling cell sizes until all of these arc included in cells 
containing at least five particles. As implemented here, this 
procedure stops when ~ 1 per cent of the particles remain 
in cells with less than five particles: the inaccuracy in the 
assignment of local densities to these few particles in low 
density regions has no bearing on the power spectrum com- 
puted for the weighted particle distribution. 

The degree of smoothing applied to the density field by 
this process clearly depends on the value of A'^. In Fig. 1 we 
show how the level of bias produced by a particular power 
law bias model from Section 4 varies with the value of A'^: 
as one would expect, the small-scale bias increases with N, 
(as less smoothing allows greater local densities, and, thus, 
higher particle weights), until it saturates at a certain value 
of N, where the cell size becomes smaller than the size of 
the most overdense virialized regions, and further reduction 
cannot increase the smoothed densities produced. This satu- 
ration sets in at iV ~ 100, and we choose for the evaluation 




0.1 0.2 0.5 1 2 
k / (h Mpc"^) 

Figure 1. The scale-dependence of bias for power law bias as a 

function of the resolution scale used to obtain the local density: 
[b(k)]'^ is defined to be the ratio of 'galaxy' and mass power spec- 
tra. For a mass model with F* = 0.25, ag = 0.64, we apply power 
law biasing with B = 1.2. The various symbols show the results of 
shrinking the cell used to obtain the small-scale density contrast 
from iV=10 (empty stars) to JV=140 (largest circles). The hori- 
zontal line shows the linear bias value predicted by the lognormal 
model. 

of our local densities a value of iV = 120, corresponding 
to £ = 0.83(0.25/F*) Mpc. For the mass models wc are 
most interested in, with F* ~ 0.25, this is roughly equiv- 
alent to smoothing with a spherical top hat of filter radius 
f?TH — 0.5 Mpc, so that the scale on which we define our 
local density is similar to that on which cluster M/L ratios 
are evaluated, and on which we assume physical bias mech- 
anisms operate. Note that, in Fig. 1 and subsequent figures, 
a numerical artefact means that the power in the longest- 
wavelength bin is slightly incorrect: the magnitude of this 
effect is small enough to have no bearing on our results. 

3.2 Large scales: quasi-linear power spectrum 
extension 

The range of scales which we need to consider is too large 
for present-day A'^-body simulations, so we use the method 
of PD94 to map linear power to non-linear power on large 
scales. The heart of this method, based on that of Hamilton 
ct al. (1991) for evolving the integrated correlation function 
^(r), is the pair of postulates that: (a) gravitational col- 
lapse can be considered as a translation of scales, so that 
wavcrmmbers and ^nl in the linear and non-linear power 
spectrum, respectively, are related by 

fcL = [l-hAL(fcNL)]""''fcNL, (16) 

where ANL(feNL) is the non-linear power spectrum; 
and (b) that there should be a universal mapping 
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Figure 2. The scale-dependence of bias for power law bias as 
a function of the power law index: B = 1.1,1.2,1.3,1.4,1.5 in 
order of increasing bias. A CDM model with (F* = 0.25, (Jg = 
0.64) is assumed. The horizontal lines show the linear bias values 
predicted by the lognormal model. 

AnlC^nl) = /nl [AL(fcL)] between the linear and non- 
linear power spectra. PD94 used knowledge of the asymp- 
totic behaviour of /nl, together with numerical simulation 
data, to determine its full form. A more recent paper (Pea- 
cock & Dodds 1996) takes account of a weak dependence of 
/nl on the linear power spectrum, but the difference between 
the two prescriptions is small on the quasi-linear scales in 
which we are interested. 

To perform this extension, we take the bias parameter 
b{k) to be constant on large scales. This is motivated by the 
numerical results below, which show that b{k) indeed tends 
to a constant at small k. We estimate the constant from the 
smallest three wavenumber bins used in the determination 
of the power spectrum of the weighted particle distribution 
in the numerical simulation, and tests on larger boxes sug- 
gest that this procedure for estimating the asymptotic value 
of b{k) is accurate to ~ 0.1 or better for the bias prescrip- 
tions in which we are most interested. An alternative method 
for setting the large-scale bias would be to use the analytic 
lognormal model, but, as shown in Fig. 2 and subsequent 
figures below, this is not usually sufficiently accurate. 

4 POWER SPECTRA WITH EULERIAN BIAS 
4.1 Power law bias 

In Fig. 2 we show the b{k) curves for a range of B values and 
a single mass model (F* = 0.25, ag = 0.64). As one would 
expect, the b{k) curves steepen as B gets larger: giving more 
weight to over-dense regions increases the overall level of 
bias, as these regions are more strongly clustered than the 
mass distribution as a whole, but this is felt most strongly at 



Figure 3. The scale-dependence of bias for power law bias with 

B =1.2, for CDM models with F* = 0.25 and as varying from 
0.37 to 1.0. The an values plotted arc as follows: 0.37 (empty 
stars), 0.47 (triangles), 0.56 (crosses), 0.64 (stars of David), 0.72 
(filled squares), 0.80 (filled circles), 0.87 (filled stars), 0.93 (empty 
squares), and 1.0 (empty circles). Note that the trend of bias at 
given k is not monotonic with as, and that the maximum bias is 
attained at as = 0.56: this is discussed further in the main text. 
The horizontal line shows the linear bias value predicted by the 
lognormal model. 



high k, which probes clustering power on separations within 
the scale of these regions themselves. The dashed lines show 
the corresponding values of 6ln, which appear to yield a 
slight underestimate of the asymptotic level of bias. 

Figs. 3 & 4 show, respectively, the effect on b{k) of vary- 
ing the amplitude and shape of the linear mass power spec- 
trum. From Fig. 3 we see that b{k) is remarkably insensitive 
to amplitude: the small-scale bias varies by only about 15 
per cent between the earliest and latest epochs, although 
the power in those wavenumber bins increases by more than 
a factor of ten in that time. Detailed inspection of Fig. 3 
shows that b{k) at given k initially increases with increasing 
as to as ~ 0.6, after which the amplitude of the b{k) curve 
begins to fall as as increases further. One would expect b(k) 
to increase with as, since, as as gets larger, more mass be- 
comes concentrated in the densest environments, which in- 
creases the level of bias on all scales, but preferentially on 
small scales, for the reasons given above to explain Fig. 2. 
To understand the reduction in b{k) at large as, consider 
what would happen if the simulation had been left running 
for a long time, until crs 3> 1. In the limit of a very highly 
evolved simulation, essentially all the mass would reside in 
a number of very massive clumps. Applying bias to such a 
mass distribution would have very little effect, so we expect 
b{k) —^ 1, as as — > oo, hence a reduction in b{k) for larger 
as- What this tells us is that the latest epochs in the sim- 
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Figure 4. The scale-dependence of bias for power law bias with 
B =1.2, for CDM models with as ~ 0.85 and F* varying from 
0.175 (highest) to 0.3 (lowest). The horizontal line shows the lin- 
ear bias value predicted by the lognormal model. 



Figure 5. The scale dependence of bias for CO bias with Ci =1.5 

and C'2 varying from -0.4 (lowest) to -0.05 (highest), for a mass 
model with F* = 0.25 and ag = 0.64. The horizontal line shows 
the linear bias value predicted by the lognormal model. 



ulations are starting to become affected by the absence of 
very large-scale power beyond the box scale. 

In Fig. 4 we see a slightly larger spread in b{k), with 
models with lower T* having somewhat larger bias at all 
wavenumbers. Given the form of the transfer function of 
CDM- like models (equations 12 and 13), we see that the 
effect of varying F* at constant as can be thought of as 
shifting the power spectrum in wavenumber, to a model with 
a different degree of evolution, which is lower for the lower F* 
values. Thus, one can imagine producing Fig. 4 by shifting 
the b{k) curves in Fig. 3 to larger k: the value of ag used for 
the models plotted in Fig. 4 is larger than the critical value 
of (78 ~ 0.6, so the Fig. 3 curves one would shift would all 
be flattening ones in the higher as regime. In this way, we 
see that the variation in b{k) with F* shown in Fig.4 reflects 
how concentrated into large, discrete clumps is the mass in 
a CDM-like model at constant as as a function of F* , which 
determines how large an effect is produced by biasing that 
mass distribution. 

Overall, the main features of these plots are that the 
bias is a rnonotonic function of scale and that it varies rel- 
atively slowly as a function of scale, changing from e.g. 1.5 
to 3.0 over a range of scales where the observed A^(fc) 
alters by a factor of more than 300. The slow variation 
of b{k) is in marke^d contrast to the lognormal prediction: 
1 + = {1 + Cm)^ , where and are, respectively, the 
mass and galaxy correlation functions. For B = 1.5, this 
implies an effective bias of 75 at ^ = 1000, which is a far 
more rapid variation than is seen in the A'^-body results. 
Note also that the lognormal model predictions indicated 
by the horizontal lines in Figs. 2-4 always appear to lie be- 
low the asymptotes towards which the numerical 6(fc) curves 
are tending. 



4.2 Cen-Ostriker bias 

The b{k) curves resulting from biasing the mass model of 
Fig. 1 by a series of CO models axe shown in Fig. 5: the value 
of Ci is held constant and C2 is varied, to show the sensi- 
tivity of the model galaxy power spectrum to the weighting 
of the very densest regions. Note that, for C2 < 0, the sup- 
pression of the weight given to the densest regions reduces 
the overall level of bias, so that the large-scale asymptote, 
boo, falls below boo = Ci, and that there is a change in the 
shape of the b{k) curve, making b{k) increase with scale. The 
lognormal predictions are shown on Fig. 5, and capture the 
main effect of varying C2: nevertheless, as with pure power 
law bias, the model does not predict the asymptotic level of 
bias perfectly. 

Coles (1993) showed that local biasing of a density fleld 
which is Gaussian, or a local transformation thereof, will 
produce a real-space bias parameter, b(r), (deflned to be 
the square root of the ratio of the galaxy and mass corre- 
lation functions) which decreases monotonically with scale. 
For CDM-like spectra, the same analysis predicts a mono- 
tonic decrease in b{k) with increasing scale, with the excep- 
tion of the very largest scales {k ^ 0.01 h Mpc~^), where 
the tangent spectral index is ricff > and, consequently, 
fe(fc) can increase unboundedly. The results of Fig. 5 show 
how the Coles (1993) inequalities breaks down once the mass 
distribution becomes sufEciently non-linear; like the failure 
of the lognormal model to predict correctly the large-scale 
asymptotic bias level, this indicates that the evolved mass 
distribution is not well described by a lognormal. 
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4.3 Censoring 

The result of censoring the same mass model is shown in 
Fig. 6, which indicates that the level of bias increases as 
more of the void regions are removed. However, it is only 
when /m ^ 0.6 that the curvature in b{k) begins to increase 
appreciably, showing again how the small-scale power is de- 
termined by the densest regions of space. The same can be 
said for Fig. 7, which shows the same effect in the censor- 
ing of a S = 1.2 power law bias model at various thresholds. 
The dashed lines arc the predictions of the lognormal model, 
which once again produces a fair, but not perfect, estimate 
of the level of large-scale bias. 

Both in the lognormal approximation, and in the nu- 
merical datasets, the effects of censoring seem less extreme 
than might have been expected on the basis of the following 
naive argument. If we make up the mass distribution by the 
addition of an unclustered background to a clustered field, 
then censoring might be expected to correspond roughly to 
the removal of the unclustered part. If we write 



p{l + S) = pc{l + Sc) + Pu, 
then we have 

Sc = [1 + Pu/Pc]S, 



(17) 



(18) 



so that the effective real-space bias is just b — {1 — /m)~^, 
where /m = pu/{Pc + Pu) is the fraction of the mass which 
has been censored. To solve the M/L problem by censoring 
alone, we would need /m — 0.8, implying & = 5 by this ar- 
gument. However, the lognormal model yields only b = 1.56 
for /m = 0.8 (assuming a = 2.5, but only wealdy dependent 
on cr): so, an amount of censoring large enough to have an 
important effect on M/L values thus produces the sort of 
bias factor needed in reality to give a sensible us if f2 = 1. 

4.4 Scale dependence of bias 

One interesting general feature of the bias seen in the numer- 
ical results is the relatively weak scale dependence. Consider 
the prediction in case of power law bias of a lognormal field, 
as given in the Appendix: 



(19) 



This implies a very rapid increase of bias, once the field 
reaches scales where the fluctuations are non-linear. 

Even for the case of power law bias, such a rapid de- 
pendence on scale is not seen in our A^-body data, as il- 
lustrated for a variety of models below in Fig. 8. This plot 
shows a fitting formula for the scale-dependence of bias in 
k space, which is inspired by the lognormal form, but which 
introduces an extra parameter in order to allow pure scale- 
independent bias as one limit: 



1 + = (1 + 6iA^) 



2 \b2 



(20) 



As may be seen from the figure, this form is capable of giving 
an excellent description of the data. For pure power law bias, 
the results scale with B almost exactly as 



(61,62) = (B^■^s°■^), 



(21) 



so the non-linear component of the bias is much smaller than 
the scale- independent component: this is a general feature 
of all the models we have investigated. 
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Figure 6. The scale-dependence of bias for censoring bias with 

/m varying from 0.35 (lowest) to 0.65 (liighcst), for a mass model 
with r* = 0.25 and ag = 0.64. The horizontal lines show the 
lineaj: bias values predicted by the lognormal model. 
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Figure 7. The scale dependence of bias for a combination of 
power law bias with B = 1.2 and censoring with /m varying from 
0.35 (lowest) to 0.65 (highest), for a mass model with P* = 0.25 
and ITS = 0.64. The horizontal lines show the linear bias values 
predicted by the lognormal model. 
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Figure 8. Numerical data for b{k) and model fits. This figure con- 
siders the case F* = 0.25, (Jg = 1 only. The top panel shows pure 
power law bias with B = 1.1,1.2,1.3,1.4,1.5 (solid points and 
full Unes) and CO bias with Ci = 1.2 and C2 = -0.1, -0.4, -0.7 
(open points and dashed lines). The bottom panel shows a com- 
bination of censoring (/m = 0.3, 0.4, 0.5, 0.6) with power law bias: 
S = 0.8 (solid points and full lines) and S = 1.2 (open points 
and dashed lines). 



Notice also that the power law models have a large-scale 
linear bias of 600 = [fei62]'^''^ — B^'^, which is larger than 
the simple relation boo = B expected from the lognormal 
model. The lognormal model therefore fails quite badly as 
a model for the non-linear density field produced in strong 
gravitational non-linear evolution, which should come as no 
surprise: the models studied in the Appendix are invaluable 
as heuristic aids, but should not be applied quantitatively 
to the real Universe. 




Figure 9. A plot of the CfA slice [30° < <5 < 60°; V < 
12000 kms^-"^] (panel a), with a variety of synthetic Af-body 
simulations having the same selection function and numbers of 
'galaxies'. The simulations used n = —1.5 initial conditions, with 
n = 0.3 in panel (b) and $7 = 1 in panel (c). The normalization 
was set at as = 1-0 in (b) and 0.6 in (c). Panels (d) and (e) bias the 
Q = 1 data with respectively B = 1.23 and B = 1.23, /m = 0.1. 
Panel (f) biases the open simulation with /m = 0.1. It appears 
that censoring is needed in both cases to obtain sufficiently empty 
voids. 



5 COMPARISON OF CDM-LIKE MODELS 
WITH OBSERVED GALAXY CLUSTERING 

We now attempt to use some of the above results to investi- 
gate which models are capable of accounting for the observed 
statistics of galaxy clustering. This can be approached at a 
number of levels, of which the simplest is the study of the 
fluctuation spectrum. We shall mainly restrict ourselves to 
this question here, but it may be useful to begin by illus- 
trating in more detail the effect of our bias models on the 
simulated galaxy density field. 

Fig. 9 compares the true local galaxy distribution with 
various models. In panel (a) we show the local {V < 
1200 kms"'^) distribution of galaxies from the CfA slice 
(Huchra et al. 1992, with more recent updates), while sim- 
ulations (b) and (c) show unbiased fl — 0.3 and Q = 1 
universes, respectively, for a mass model whose linear power 
spectrum has a slope (n = —1.5) roughly equaling that of 
CDM on galactic scales and a normalization set by the clus- 
ter abundances. The clearest feature of these examples is an 
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insufficient contrast between high density regions and voids. 
Panel (d) appHes power law bias with B = 1.23 to the = 1 
case, but this improves the situation only slightly. Panel (e) 
adds censoring with /m ~ 0.1, and this 'opens out' the voids 
to roughly the correct extent. Finally, panel (f) applies the 
same censoring to the low density model, again with reason- 
able results at the visual level. We thus believe that our bias 
models do capture the main essentials of the modifications 
needed in order to make dark matter distributions resemble 
the real distribution of galaxies. For the remainder of the 
present paper, we shall be content to restrict ourselves to 
comparison of data and model at the two-point level. 



5.1 Observational data 

The observational data we use for comparison with the mod- 
els are: 

(i) Large-scale normalisation: the four-year COBE data 
yield Qrms-ps = 1?>A^K (e.g. Tegmark 1997). We allow a 
conservative 10 per cent uncertainty in that figure. 

(ii) Intermediate scale normalisation: as = 0.57 ± 0.05 
from the abundance of rich clusters (White et al. 1993). We 
consider models with 0.5 < ag < 0.7. 

(iii) Galaxy clustering: real-space APM galaxy power 
spectrum (Baugh & Efstathiou 1993) over the wavenumber 
range -1.5 < logio(fc/ /iMpc"^) < 0.5. 

The galaxy clustering range chosen spans the apparent 

point of inflection in the APM power spectrum, including 
data on the linear scales studied by PD94, as well as the 
non-linear regime omitted from their analysis. We have not 
used the largest scales because of possible uncertainties in 
surface uniformity of the survey, and have multiplied the 
power spectrum of Baugh & Efstathiou (1993) by 1.25 to 
account for the non-zero median redshift of the APM survey: 
see Peacock (1997) for details of that correction. 

For models which satisfy the above constraints, we 
investigate whether they can give sufficiently large mass- 
to-light ratios in clusters. In rough terms, we require 
Pg/pg ~ 5000 in regions where pm/pm — 1000. One can ar- 
gue in detail about these numbers, but they should be ap- 
proximately correct for cluster masses and sizes: in fact, as 
we shall see, no model gets close to the required galaxy den- 
sity. 

We note in passing that we find that no V* model can 
be linearly biased [i.e. Ag = 6^(/c)A^, with h{k) constant] in 
such a way as to satisfy these three constraints. The prob- 
lem lies with the shapes of the evolved mass power spec- 
tra of CDM-like models, which do not match that of the 
APM power spectrum, implying that to bias successfully a 
r* model will require a non-linear b{k), and thus motivat- 
ing our study of non-linear bias prescriptions. In any case, 
it is not clear how a linear bias may be achieved from a 
local transformation of a highly non-linear density field, al- 
though numerical studies involving detailed prescriptions for 
galaxy formation have shown a nearly linear bias (Kauff- 
mann, Nusser & Steinmetz 1997). 



5.2 Results 
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Figure 10. Contours of between power law biased CDM power 

spectra and the APM data: the contours arc at Ax^ = 15, with 
= 30 being the lowest contour sliown, near the minimum at 
r* ~ 0.17, o"8 ~ 1.0. Tlie horizontal hues show the allowed range 
of (Tg from the cluster abundance constraint The diagonal lines 
show the constraint required to satisfy the COBE measurement 
for the case of no tilt. 



5.2.1 Power law bias 

In Fig. 10 we show contours of for fits of power law bi- 
ased CDM-like models to the APM data. There is a clear 
preferred model, with P* = 0.175, trs = 1.03, S = 0.86 and 
= 14.5, for 13 degrees of freedom: the power spectra are 
compared at 14 wavenumbers, so there are 13 degrees of 
freedom in the power law bias fits and 12 in the {B, /m) 
and (Ci,C2) fits. This model is however firmly excluded 
both by the cluster abundance constraint (horizontal lines) 
and COBE (angled lines) if the primordial spectral index 
is n = 1. This latter constraint is removed if the spectrum 
is tilted, but even so the cluster abundance difficulty re- 
mains. If we insist on a normalization of trs — 0.6, to fit the 
cluster abundance normalisation, the preferred value of P* 
is unchanged at 0.175, B increases to 1.1 and ^ increases 
sharply to 43.3. These models are compared to the APM 
data in Fig. 11. Neither alternative fully reproduces the ob- 
served data, especially those below the inflection around 
fc ~ 0.1 /iMpc^^, but they exploit the errors in order to 
reduce the discrepancy to the point where the overall is 
moderate. 

The best-fitting values of B for the models of Fig. 10 
are plotted in Fig. 12. The contour lines (for B <; 0.8) are 
very close to being lines of constant erg. Since ag is related 
roughly to a scale k ~ 0.3 /iMpc"'^, which is close to the 
asymptotic region where 6(fe) is constant, this indicates that 
the most important effect of the bias is in setting the nor- 
malisation, rather than the detailed shape, of the galaxy 
power spectrum: this bears out the results given in equation 
(21). Note that Fig. 10 shows that x^ decreases slightly as 
(Ts falls below 0.4 for P* = 0.15 - 0.20, and, from Fig. 12, 
the best fit B values _B ~ 1.5 are much higher than those 
near the global x^ minirrmni. This indicates how, given the 
linear power spectrum of a particular mass model, there are 
two ways to try to achieve the steepening required to match 
the APM power spectrum: use a high bias on a moderately 
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Figure 11. A comparison between the APM data (filled circles) 

and two power law biased CDM models: the stars show F* ~ 
0.17, (78 = 1.0, B = 0.85, and the triangles show V ~ 0.17, eg = 
0.55, B = 1.3. 





1 1 1 1 1 1 1 1 










. - 0.90 






=^ 1.10 




1.10 












1.30 




1 . . . -r 





0.15 0.20 0.25 0.30 

r* 



Figure 12. The best-fitting value of B in power law bias models, 
as a function of (r*,cr8). The horizontal lines show the allowed 
range of its from the cluster abundance constraint. The diagonal 
lines show the constraint required to satisfy the COBE measure- 
ment for the case of no tilt. 



Figure 13. Contours of ^ ^ function of the CO parameters 
Ci & C2 in the comparison between biased CDM power spectra 
and the APM data. The contours are at A(logiQ x^) = 0.3 above 

X2 = l. 

for these mass models has a very characteristic form, as il- 
lustrated by Fig. 13, which shows it for one such model, with 
(r* = 0.25 and as = 0.64). In all cases the contours form 
a wedge-shaped valley in the {C\,C2) plane, the gross fear- 
tures of which are readily explained. The introduction of the 
C2 term into the bias formula allows these mass models to 
improve agreement with the APM power spectrum by main- 
taining their large-scale power (by increasing Ci above the 
value of the best-fit B value) in the face of its suppression 
by the negative C2 value which is needed so as not to pro- 
duce too much small-scale power. The floor of the valley in 
the surface gently rises with increasing Ci, but wo need 
not concern ourselves with whether or not models matching 
the APM power spectrum are to be found at large Ci , since 
the increasingly negative value of C2 required to make that 
match very rapidly reduces the predicted cluster M/ L ratio 
to very low values. In fact, acceptable x^ values are rarely 
found above Ci = 2, and the negative C2 values required 
by acceptable models means that their mass-to-light ratios 
fall short by factors of several hundred from that required 
to reconcile observed cluster M/L values with O = 1. We 
therefore exclude all CDM-like models biased according to 
the Cen-Ostriker formula. 



evolved mass distribution, or evolve the mass distribution 
much more and use anti-bias {B < 1) to pull down the 
power to the desired level. 

5.2.2 Cen-Ostriker Mas 

For the remainder of our analysis wo restrict our attention 
to the region of the (r*,cr8) plane which satisfies both our 
normalisation criteria. The form of the x^(C'i)C'2) surface 



5.2.3 Censored models 

In Fig. 14 we plot x^i-Bjfu) for a particular allowed mass 
model (r* = 0.25, o"8 = 0.64 again). The basic form of this 
curve is, again, characteristic and easy to understand: as 
more and more weakly-clustered mass is censored, a lower 
value of B is required to match the APM power spectrum 
to a given accuracy. Typically, the best-fit censored power 
law model has /m — 0.35— 0.45 and a value of B lower than 
that for the best pure power law bias model. Those censored 
power law bias models that give a good fit to the APM power 
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1.2 



Figure 14. Contours of for the bias model witli power law 
bias and censoring, as a function of the parameters B and fm, 
in the comparison between biased CDM power spectra and tiic 
APM data. The contours are Ax^ = 15, with the inner contour 
being = 30. 



spectrum may bo excluded on M/L ratio grounds, as is clear 
from Fig. 14, which shows that increasing /m ordy leads to 
lower values of B: e.g. censoring half the mass increases the 
mass-to-light ratio by a factor of two, but the low value of 
B reduces it by a very much larger factor. 



5.3 Comparison of bias prescriptions 

An enlightening comparison of the effects of the three bias 
prescriptions is provided by Figs. 15 and 16. In Fig. 15, the 
filled circles show the APM galaxy power spectrum, while 
the stars of David show the mass power spectrum for a model 
with r* = 0.25 and ag = 0.64. Clearly, to match the APM 
power spectrum, this mass model needs a significant bias 
on large scales, below the inflection in the APM power spec- 
trum. However, the required level of bias falls with increasing 
wavenumber, and there is little room for bias on the smallest 
scales. The best fit power law bigis model yields = 96.2 at 
B = 1.07, and its power spectrum is shown by the stars in 
Fig. 15, where it is clear that only a modest power law bias 
produces too much small-scale power, while falling short of 
the desired clustering strength on large scales. The freedom 
to censor mass produces the power spectrum given by the 
triangles in Fig. 15, which has B = 0.93 and /m = 0.38 
(giving = 62.3), while the best fit Cen-Ostriker model is 
Ci = 2.0, C2 = -0.19, (crosses), which yields = 27.1. 
The different effects of these three models arc better shown 
by their b(k) curves, which are plotted in Fig. 16. Censoring 
almost 40 per cent of the mass produces the required in- 
crease in large-scale power, but B < 1 is required to restrict 
the level of small-scale clustering, which still exceeds that 
of APM galaxies. The great freedom afforded by the intro- 
duction of the C2 parameter is shown by the much larger 
reduction it allows, as well as by the shape of the h{k) curve 
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Figure 15. Biasing a mass model to fit the APM power spectrum. 
The mass power spectrum for a model with T* = 0.25 and its = 

0.64 is shown by the stars of David, while the filled circles plot 
the APM power spectrum. The best fit models from the tfirec 
bias prescriptions are shown as follows: (a) power law bias with 
B = 1.07 (stars); (b) censored power law bias with B = 0.93 and 
/m = 0.38 (triangles); and (c) Cen-Ostriker bias with Ci = 2.0 
and C2 = —0.19 (crosses). 



it produces in Fig. 16: a much better fit to the APM power 

spectrum is obtained, as equation (10) allows a substantial 
large-scale bias at the same time as keeping the small-scale 
clustering power pinned, rising only slighty above the level 
of that found in the mass distribution. The problem with 
this model, of course, is that it fails to provide a sufficiently 
large boost to M/L on scales of clusters. There appears to be 
no way in which we can achieve the necessary b{k) without 
violating this constraint. 



6 DISCUSSION AND CONCLUSIONS 

We have investigated the effect of local biasing on the clus- 
tering statistics of galaxies. In general, we find that local 
biasing gives rise to a scale-dependent bias, so the shape of 

galaxy power spectrum no longer directly reflects the under- 
lying mass distribution. However, the change in b{k) with 
scale is often rather modest: if boo — 1-5, as suggested by 
observations and COBE normalisation, then the models we 
have investigated would only predict at most fe(fc) ~ 3 at 
fe ~ 1 /iMpc~^. The scale dependence of the bias appears 
to be monotonic, but b{k) does not decrease with scale in 
all cases, showing how the inequalities of Coles (1993) break 
down once the density field is sufficiently non-linear that 
it is no longer well described by a Gaussian field, or a lo- 
cal transformation thereof. The b{k) curves produced by our 
biasing prescriptions are also quite smooth, so it would ap- 
pear that any model capable of producing sharp changes in 



© 0000 RAS, MNRAS 000, 000-000 



12 R.G. Mann, J. A. Peacock and A.F. Heavens 



cv 

CO 
CD 

cv 



- 

- 




1 




1 






1 






- 
- 




A 


A 


A 


A 


A 


A 


A 

A 


A 


A 


- 


* 




























it 




* 


* 


* 


* 


ik 





0.1 0.2 0.5 1 2 

k / (h Mpc"^) 



Figure 16. The scale dependence of the bias in the three galaxy 
power spectra sliown in Fig. 15: (a) power law bias with B = 1.07 
(stars); (b) censored power law bias with B = 0.93 and /m = 0.38 
(triangles); and (c) Cen-Ostriker bias with Ci = 2.0 and C2 = 
—0.19 (crosses). 

galaxy power with respect to that of the mass would have 
to involve non-local bias. 

The scale dependence of bias clearly presents difficulties 
in the interpretation of observed galaxy clustering, by allow- 
ing a wider range of mass power spectra to map onto the 
galaxy power spectrum data through the use of a suitably 
curved h{k). Interpretation of even these models is simplified 
on large scales, however, where the bias tends to a constant 
value. 

Wc have looked at a range of galaxy clustering models 
in the context of an Einstein - de Sitter universe, to see if 
any of these models can simultaneously fit the APM power 
spectrum and the observed mass-to-light ratios in clusters 
of galaxies. The strong conclusion we reach is that COBE- 
normaliscd CDM-likc models with fixed as required by 
the observed abundances of clusters do not succeed, given 
the biasing functions we have explored. In addition, the 
models fail to reproduce the required mass-to-light ratios by 
large factors, so if any successful bias function were found, 
it would probably be rather pathological. 

There are several ways to improve the agreement of the 
models with observation. One is to lower the density below 
Q = 1, thus reducing or removing the constraints on our 
models in the high-density regions. The lower Q has the ef- 
fect of increasing the required as from cluster abundances, 
but the non-linear effects on the mass spectrum depend on 
Q only on scales smaller than those we have considered here 
(Peacock &: Dodds 1996). The second is to change the shape 
of the underlying power spectrum to one with a sharper 
break than CDM, such as mixed dark matter (e.g. Taylor & 
Rowan-Robinson 1992; van Dalen & Schaeffer 1992; Klypin 
et al. 1993). This latter possibility is the only existing model 



which can explain the full inflection in the APM power spec- 
trum around k ~ 0.1 /iMpc~^, since CDM-like models pro- 
duce a much smoother variation at this point. 

The conclusions we draw from our results are thus as 
follows: 

(i) No linearly-biased CDM-like model can reproduce the 
APM power spectrum: a non-linear b{k) curve is required 
if the cosmological fluctuations spectrum is one of the 
r* models given by equations (12) and (13). Analyses 
(such as that of PD94) which do allow a linearly-biased 
r* model can do so only because they ignore clustering 
data on non-linear scales, which greatly weakens their 
power to reject models. 

(ii) Non-linear bias prescriptions gcncrically produce a 
scale-dependent bias, but it is difficult to produce a 
strongly curved b{k) relation with Eulerian bias, so a 
confirmation of the APM inflection would be strong evi- 
dence for some feature in the primordial linear spectrum 
at that scale or for non-local bias in galaxy clustering. 

(iii) Analytical models based on a lognormal mass distri- 
bution are useful heuristics for gauging the rough level 
of the large-scale bias produced by a given prescription, 
but they are not sufficiently accurate to replace numer- 
ical simulations. 

(iv) No Einstein - de Sitter CDM-likc mass model can be 
biased, using the range of bias prescriptions studied here, 
to match the APM power spectrum if it is required to 
be COBB-normaliscd and account for the abundances 
and M/L values of clusters of galaxies: if these condi- 
tions are to be satisfied using the bias models considered 
here, then the mass power spectrum must be flatter than 
CDM-like models on small scales. 
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APPENDIX A: BIASED LOGNORMAL 
DENSITY FIELDS 

The lognormal density field is a convenient analytical model 
in which the effects of various simple Eulerian density trans- 
formations can be calculated quite simply. The non-linear 
lognormal density field is produced by taking the exponen- 
tial of a Gaussian density field: 



1 + ^LN = exp(^G)- 



(A3) 



1 + 5ln = exp ((5g — cr^/2) 



(Al) 



The variance of the Gaussian field is and the second term 
in the exponential is a normalising factor, so that {5ln) = 0. 
Although this transformation resembles a form of bias, and 
has been discussed in that context, it is important to be clear 
that this is not what is envisaged here. The lognormal field is 
a model for the mass density field: it is the simplest extension 
of the Gaussian random field which is physical in the sense 
of forbidding negative densities. The lognormal field differs 
from its generating Gaussian only when a is significantly 
larger than zero, so the lognormal transformation should be 
thought of as a means of simulating the effects of non-linear 
gravitational instability. 

Al Power law bias 

The simplest modification of a density field is a power law 



p oc oc exp(_B(5G). 



(A2) 



The effects of this modification are easily deduced, since 
Coles & Jones (1991) showed that 



The power law model is just the original lognormal model 
with the generating Gaussian scaled by a factor B, and so 



1 + ?PLN = exp(B'eG) = (1 + Cl 



(A4) 



Interestingly, this modification does not increase the skew- 
ness of the density field. We define the skewness parameter 
(not the skewness itself) as 



S-. 



Integrating over the Gaussian field gives 



(A5) 



(A6) 



Since the observed density rms in the linear regime is just 
Ba, this says that the observed skewness- variance relation is 
unaltered by bias, which masquerades as extra non-linearity 
in the mass field. This degeneracy between bias and evo- 
lution is a feature of the lognormal model. Genuine gravi- 
tational evolution leads to different shaped structures (flat- 
tened on first collapse), and bias and evolution can be sep- 
arated by studying shape-dependent statistics such as the 
bispectrum or 3-point correlation function (e.g. Matarrese, 
Verde & Heavens 1997). 



A2 Censoring bias 

A power law provides a significant boost in density at large 
over-densities, but it is also plausible that the galaxy density 
may suffer abrupt truncation at small over-densities, i.e. in 
the voids. 

In the case of no other bias, this is the model studied 
by Catelan et al. (1994). The method of evaluation is the 
same in all such cases: write S' — f{5), where it is assumed 
that the function / is normalised to force {S') — 0, and use 



1 + ^' = {{1 + S[){1 + S',)). 
The correlation function is then 



(A7) 



{l + fi)il + f2)P{5i,52)dSid52, (A8) 



where the Gaussian joint distribution involves the correla- 
tion coefficient r = ^/(t^. For the case of truncated power 
law bias, the modified density field is 

explBSa - BVV2] 



1 + S' = 



0.5 



erf 



{ v-Ba \ 

\ ^ J 



(A9) 



where the threshold is specified in terms of its value in the 
Gaussian field that generates the mass: 5' = —1 for So < va. 

For the case of censored power law bias, the correlation 
function reduces to the ID integral 

1 + C'=\/2/V- 



-S-^o-^{l + r^)/2 



[l-eT:i{u- B(J)/^/2Y 



1 -erf 



i' — Ba-\-r Ba — rx 
V2{1-'-^) 



da;. (AlO) 



In the ^ ^ 1 regime, this reduces to ^' = b\j<fr(j'^ , with 
the linear bias parameter 
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bi,N = B + 



1 - erf 



/ iy-Ba \ 



(All) 



which reduces to the result of Catelan et al. (1994) for B = 1, 
once allowance is made for a spurious factor of 1/ \/2 in their 
equation (19). 

The results for the case of censoring (with or without 
power law bias) are hardly transparent, and it may be help- 
ful to illustrate them in one particular case. Consider the 
following model for the galaxy correlation function, shown 
in Fig. Al: 



(A12) 



For the case of pure power law bias, it is simple to recover 
the corresponding mass correlation function, and to obtain 
from this the correlation function for the Gaussian process 
which would generate the mass field as a lognormal process. 
We shall consider the specific case B = 1.23. We also take 
Cmax = 10*, giving a = 2.47, although this is not critical. 

We now compare the action of censoring bias on the 
same mass field. To achieve the same linear bias of 1.23 
requires a threshold u = 2.1, corresponding to censoring 
98 per cent of the volume and 36 per cent of the mass. 
As a perhaps more realistic compromise model, we consider 
censoring and power law bias in combination. A threshold 
of u = 1.3 censors 90 per cent of the volume and 12 per cent 
of the mass, and requires B = 1.18 for the same linear bias. 
These alternatives are compared in Figure Al. We see that 
censoring alone steepens the correlation function, but not to 
the same extent as power law bias with the same large-scale 
bias. An excellent fit (not shown, as it is hard to see any 
deviation) is obtained with the two parameter formula 

1-he' = (l-FfeiO*", (A13) 

which is of course a perfect fit in the power law only case. 
For censoring only, (6i, 62) ~ (1.35, 1.12) is needed, changing 
to (1.09, 1.39) in the combination case. 

Although this is only one example, the general be- 
haviour follows this trend: for a given linear bias, censoring 
is less effective at steepening the correlation function than 
is power law bias. To increase the small-scale correlations 
by more than a factor ~ 3 requires very severe thresholding 
in which galaxies only form in the very densest of environ- 
ments. 



A3 Cen-Ostriker bias 

A final example which is amenable to analytical treatment is 
the case considered by Cen & Ostriker (1992), where there is 
a quadratic relationship between the logarithms of the light 
and mass densities: 

ln(pg/pg) = Co + Ci hi{pm/pm) + C2[ln(pm/pm)]^ (A14) 

Note that for C2 = 0, this is just the earlier power law model, 
and that the maximum value for C2 is l/2a^, as the mean 
density diverges if that limited is exceeded. If the density 
field is lognormal, then the properly normalized version of 

this becomes 




.w- o _ 



Figure Al. A plot of various correlation functions. The dotted 
line shows S, for the mass, set so that pure power law bias with 
B = 1.23 produces ^' which is a reasonable match to observed 
galaxy clustering (7 = 1.7, with clustering extending up to ^ = 
10*). The three models shown all have the same linear 600 = 1.23. 
In order of increasing they are (1) censoring only, rejecting 36 
per cent of the mass; (2) censoring 12 per cent of the mass and 
power law bias with B = 1.18; (3) B=1.23 with no censoring. 



X exp 



2(1 - 2C2Cr2) 

The integral for ^' can be performed in this case, giving 



(A15) 



1 + e' 



(1 - 2C2Cr")Vr 



exp 



V'[l-2C2Cr2(l-r2)]2-r2 

(Ci - ^2^2) VV 



1 - 2C20-2(2 + r)+ 4C|<t4(1 + r) 
This has the linear bias 



6ln = 



Ci — C20" 



(A16) 



(A17) 



1 - 2(72^2 ' 

which gives the power law case when C2 = 0. However, we 

also see that the bias is weakened when C2 is negative, so 
that the light density is suppressed at extreme overdensity: 
the linear bias is unity when 

C2 = C2.cHt = -(Ci - 1)/CT^ (A18) 

This paper has been produced using the Royal Astronomical 
Society/Blackwell Science MgX style file. 



1 + <5' = a/i - 2C20-2 
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